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Abstract 

In the present paper we discuss problems concerning evolutions of densities related 
to Ito diffusions in the framework of the statistical exponential manifold. We develop a 
rigorous approach to the problem, and we particularize it to the orthogonal projection of 
the evolution of the density of a diffusion process onto a finite dimensional exponential 
manifold. It has been shown by D. Brigo (1996) that the projected evolution can always be 
interpreted as the evolution of the density of a different diffusion process. We give also a 
compactness result when the dimension of the exponential family increases, as a first step 
towards a convergence result to be investigated in the future. The infinite dimensional 
exponential manifold structure introduced by G. Pistone and C. Sempi is used and some 
examples are given. 
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1 Introduction 

This paper moves both from the differential geometric approach to nonhnear filtering as devel- 
oped by Brigo, Hanzon and LeGland [9] and from the rigorous approach to the construction 
of a differential geometric structure in the infinite dimensional space of probability measures 
given in Pistone and Sempi [24], see also Pistone and Rogantin [23]. The solution of the filter- 
ing problem is a stochastic PDE which can be seen as a generalization of the Fokker-Planck 
equation (FPE) expressing the density of a diffusion process. This filtering equation is called 
the Kushner-Stratonovich equation. In [9] the Fisher metric is used to project the Kushner- 
Stratonovich equation onto a finite dimensional exponential manifold of probability densities. 
This method can be used also for the simpler FPE. In the present paper we discuss the geo- 
metric approach to problems concerning finite dimensionality of densities related to stochastic 
differential equations given by Ito diffusions. Part of these results were already given in [13], 
and we shortly present them here in the framework of Pistone and Sempi [24] . This approach is 
different from the one adopted for example in Brigo, Hanzon and LeGland [9] or in Brigo [13], 
since it uses the exponential manifold structure rather than the L2 derivation. The L2 structure 
is obtained by mapping densities into their square roots. We show that this map yields a regular 
C°° parametrization but it is not a chart for the infinite dimensional manifold of densities. In 
the present paper we consider the projection in Fisher metric of the density-evolution of a diffu- 
sion process onto an exponential manifold. Such projection is obtained via the projected FPE. 
We examine the projected density-evolution and discuss problems related to finite dimension- 
ality, giving some examples. We recall from Brigo [13] that the projected density-evolution can 
always be interpreted as the density-evolution of a different diffusion process. We conclude by 
giving a first step for future investigations on the following convergence problem: is it possible 
to prove that the projected density converges to the original one when the dimension of the 
exponential manifold on which we project tends to infinity? 

2 The exponential statistical manifold of positive prob- 
ability densities 

In the present section we give a summary of the construction of the non-parametric exponential 
statistical manifold as developed in [24] and [23] . In those papers it is shown that the definition 
of statistical manifold as introduced by Dawid, Efron, Amari and others, and systematically 
presented in [19], can be given in a non parametric setting using the framework of the theory 
of manifolds modeled on Banach spaces, as introduced for example in Lang, [18]. 

We consider a measure space {X, X, jj), where // is a reference measure, and the set M.{X, X, ji) 
of the a.s. strictly positive densities w. r. t. some measure equivalent to We define on the 
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set M.{X, X, fj) a topology such that M.{X, X, fj,) is an Hausdorff space (i.e. points can be 
separated by open sets). Then we shall construct a covering of A4{X, X, jj) with open sets Up, 
p e Up, p e A4{X, X , fi), and a corresponding family of Banach spaces Bp, with norms || • ||p, 
p G Ai{X, X, fi), such that each density q & Up is represented by a coordinate Sp{q) e Bp. 
We shall use the notations 



U„ 



VpCBp 



Vp^UpGM{X,X,fi) 



(1) 
(2) 



to denote respectively the charts, i.e. the mappings from points to coordinates, and the patches, 
i.e. the mappings from coordinates to points. 

Following the use in differential geometry, we say that {{Up,Sp) : p G A4{X, X , fi)} is an 
atlas if all the space is covered by charts; if moreover each of the change of coordinates 



Sp2 O Bp^ . Sp^ {Up, nUp,) 



{Up, nUp^) 



is a diffeomorphism of some regularity between open sets, the atlas has that regularity. In such 
a case the atlas, augmented with all the compatible charts, defines the manifold, see Lang [18] . 

In our case we shall introduce a very special manifold, such that the change of coordinates 
are actually affine functions — i.e. they differ from a linear function by a constant — , but we 
will keep a weaker regularity, namely the C°°-regularity (differentiability of any order) for 
compatible charts. 

We shall denote by Ep.^ [•] the expectation w.r.t. the probability measure p • /i (where p • 
li{dx) — p{x)ii{dx)); if there is no ambiguity we will use the notation Ep [•]. 

First we define the topology as follows. For simplicity we give only the definition of con- 
vergence of sequences. The sequence (pn)neJV in Ai{X,X,ii) is e-convergent (exponentially 
convergent) to p if {pnjneiN tends to p in yU-probability as n — > cxo and moreover the sequences 
{Pn/p)neiN and {p/pn)neiN are eventually bounded in each L"{p), a > 1, i.e. 



Va > 1 lim sup Ep 



Pn 



< +00, lim sup Ep 




< +00. 



Now we shall introduce the Banach spaces on which the statistical manifold is modeled. We 
give a definition that shows how they are connected with well-known statistical objects. For 
each density p G M{X, X, /i), the Cramer class at p is the set of all random variables u on X 
such that the moment generating function 

Up{t) = J e^ydn = Ep [e*"] , teR 

is finite in a neighborhood of the origin 0. If moreover the expectation of u is zero (the previous 
condition implies the existence of a finite expectation), then we shall call the set the centered 
Cramer class at p. 
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The centered Cramer class at p is a vector space, and it shall be denoted by Bp, i.e. 
Bp = {u e L\p- 1^) : e DomiupY, Ep [u] = 0}. 
It is a Banach space for the norm defined by: 

< l} (3) 

In the previous formula the function x i— > cosh(x) — 1 is a convex function that plays in the 
theory of the spaces Bp the same role as the function x i— > |x|°/q; in the theory of Lebesgue 
spaces q; > 1. We cite [17] and [25] as general references. 

We will denote by *Bp the Banach space of centered random variables of the so called 
X log x-class. A random variable u belongs to the x log x-class *Bp if and only if it is centered 
and (l + u) log(l + u) is (p • /i)-integrable. 

Now we give some details about the Banach spaces Bp and *Bp which will be useful in the 
construction of the statistical manifold. 

Proposition 1 1. The dual space of the Banach space *Bp is isomorphic to Bp, i.e. if T is 
a continuous linear operator on *Bp then there exists a unique u E Bp such that T{k) = 
Ep [ku], k e *Bp; that is {*Bpy 3T ^ue B. 

2. All the elements k in *Bp are identified with an element of the dual space B* of Bp with 
the identification S{u) = Ep [ku], hut *Bp is strictly smaller than B* unless the sample 
space has a finite number of atoms. 

3. Denoting with sub-0 the spaces of centered random variables, the following continuous 
inclusions hold true: 

L^ip ■pi)cBpC f]L^{p-pi)cL^{p-pi)c *Bp C B;. 

The patches of the atlas will be defined on the open ball of radius 1: 

Vp = {n G Bp : < l}; 

remark that the condition \\u\\p < 1 is equivalent to the existence of an a > 1 such that 
Ep [cosh(Q;w) — 1] < 1, which in turn implies Ep [e"] < 4, see Prop. 2 below. 

The moment generating functional Gp : L('^°**^ ~^)(p • /i) Rj^ — [0, +oo] is defined by 

Gp{u) = Ep [el . 

The cumulant generating functional Kp : Bp ^ [0, +oo] is defined by 

Kp{u) = log Gp{u). 



\u\ 



inf 



< r : 



E. 



cosh 



- 1 
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Proposition 2 (Properties of the CGF) The cumulant generating functional Kp has proper 
domain Dom{Gp) fl Bp. IfVp denotes a subset of the proper domain then Kp satisfies the fol- 
lowing properties 



1. Kp is at 0, otherwise is strictly positive; is convex and infinitely Frechet differentiable 
on Vp. The value at of the differential of order n is the value of the n-th cumulant under 
p of the random variable u. 



2. \/u e Vp, g = e 



u—Kp{u) 



p is a probability density in Ai{X, X, fi) and the value of the n-th 



differential at u in the direction v of Kp is the n-th cumulant of v under q: 



Jn 

B-Kp{u) — logE, 



t=0 



3. In particular | — 1 e *Bp and 



DKp{u) V = E,[v] = Ep 



Kp{u) V1V2 = Eq [viV2\ - Eg [vi] Eg [^2] 



(4) 



Using the definitions introduced so far, it is possible to give a definition of the non-parametric 
exponential model as follows. For each p in M.{X, X, /i) the maximal exponential model at p is 
the statistical model 

Sp = {e^-'^^W -p-.ue Dom {Kpf , Ep [u] = o} . 

The function 

Bp D Dom (Kp)° 3u^ e"-'^^^") ■peM(X,X, //) 

is the likehhood function when the 'model parameter' is u. 

We now have all the elements for the definition of the atlas. Let us consider the following 
map defined on a subset Vp of the proper domain Kp-. 

ep:Vp3u^ e"-^''(") • p E M(X, X, //), (5) 

where Kp{u) = logEp [e"] = log G'p('u) is the cumulant generating functional computed at u. 

This mapping is one-to-one because u is centered. According to (1) and (2) wc shall denote 
by Up the image of the mapping and by Sp its inverse on Up. Such an inverse, Sp : Up ^ Vp is 
easily computed, for q G Up, as 

(6) 



q 

Sp{q) = log - - Ep 



log 



p 
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The functions Sp, p e A4{X, X, /i), will be the coordinate mappings of our manifold in the sense 
that, locally around each p e M.{X,X,ii), each q &Up will be "parameterized" by its centered 
log-likelihood. 

Let us compute now the change-of-coordinates formula: if pi and p2 are two points in 
M.{X, X, /i) such that Up^ fl Up^ ^ 0, then the composite (transition) mapping 



where the algebraic computations are done in the space of |U-classes of measurable functions 
and the expectation is well defined as long as nZipj because this implies u + log ^ e Vp^. 

Theorem 3 The collection of pairs {{Up,Sp) : p G A4{X, X , n)} is an affine -atlas on 
M.{X, X , n). The induced topology on sequences is equivalent to e-convergence. 

Definition 4 (Exponential manifold) The exponential manifold is the manifold defined by 
the property in theorem 3 on the set Ai{X, X, /i). 

The manifold structure we have defined is a special one: many other types of atlases have 
been suggested in the literature, in particular the mixture coordinates and the so-called Amari's 
imbeddings described in [1]. 

In the infinite dimensional case those different geometric structures are not equivalent to 
the exponential manifold, but in some restricted sense they are, because they induce the same 
manifold structure on finite dimensional sub-manifolds (i.e. parametric statistical manifolds). 

The maximal exponential model already defined has a precise place in the general framework. 
In fact the maximal exponential model Sp is the connected component containing p of the 
exponential manifold A4{X, X, fi). 

In the previous works on the differential geometric approach to nonlinear filtering and to the 
finite dimensional approximation of the Fokker-Planck equation (Brigo, Hanzon and LeGland 
[9], [10] and Brigo [13]) we used the L2 structure to project the Kushner-Stratonovich or 
the Fokker-Planck equation onto a finite dimensional exponential manifold of densities. This 
procedure uses the map p 1— > from positive densities to their square roots as a tool which 
allows the L2 structure to enter the picture. Although this is useful to perform computations, 
and even if this approach yields the same finite dimensional approximation as in the case where 
one projects according to the exponential manifold structure discussed here (compare formulae 
given in section 4 with formulae obtained via the L2 structure given in [13]), we notice that 
this map cannot be used to define a manifold structure. It does not yield charts. This is due 




simplifies to 



o (u) = u + log u + log — 
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to the fact that any open set of L2 contains functions which are negative in a set with positive 
measure. Then we see that a chart should map open sets in the manifold onto open sets in L2, 
but these open sets would contain the functions described above, and hence they could not be 
contained in any set of square roots of densities (which are positive everywhere). This is why 
the space of square roots of densities cannot have a manifold structure based on L2. In [10] this 
problem is bypassed by defining a parametric exponential enveloping manifold. Here we use 
the exponential manifold structure to render the procedure rigorous in an infinite dimensional 
context. 

Now we show the properties of the map from M. to L2 defined hy R : p ^ y^. 
Proposition 5 The mapping 

R: M3p^ y/pe L'^iii) 
is C°° . If the tangent space is identified with Bp then its tangent map is 

TpR{v) = ]^R{p)v. 
In particular the tangent map is surjective at any p. 

Proof. Let us fix a density po and consider the coordinate form of the map R\ it is defined 
from to -^2(^0) by Hp^{u) :— ^Je^Ju). By direct computation one obtains the form of the 
directional derivative 

-^Hp^iu) = Hp^{u){l[v - DKp^{u)v]) = Hp^{umv - E^^^^^^v], 

and the norm of the differential operator 

\\D Hp,{u) v\\l = lE,,^iu){{v - ^e,„(.)^)'} = lD'Kp^{u){v,v), 

which shows that Hp^ is differentiable at u since Kp^ is infinitely Prechet differentiable. The 
coordinate- free form of the differential is TpRv — \^Jpv if we identify the tangent space at 
p with Bp. Note that the L2 norm of this first derivative represents a variance, so that the 
derivative operator is one-to-one. 

In a similar way the other derivatives of Hp^ can be computed as: 

Hp^{u){vi, ...,Vn)= 2-"i/po(M)(vi - ^epo(u)'yi, ■■■,Vn- E^^^^^)Vn). 

If one computes the L2 norm of this derivative one finds easily that the norm is bounded and 
the differentiabihty of any order follows. □ 

Hence the mapping p ^ is what wc call a regular parametrization. From the exponential 
coordinates u we deduce Hp^{u) which can be differentiated. Yet ifp^, although constituting a 
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parametrization, does not define coordinates. This is due to the fact that we are working in 
infinite dimension and a regular parametrization is not necessarily a chart. 

A basic object of the manifolds theory is the tangent bundle. In the case of the exponen- 
tial manifold it has been remarked from the very beginning of the geometrical theory that 
there is a very natural identification between the tangent vectors and the exponential one- 
dimensional models around a point p. In fact each differentiable curve in Ai{X, X, n), i.e. each 
one-dimensional statistical model p{t), t E I R, such that p(0) = p, has a tangent model 
of the exponential form e*"-^p(*") . p. This can be rephrased by saying that these exponential 
one-dimensional models seem to play the role of straight lines. 

Definition 6 (Tangent space) The tangent space Tp at p of the exponential manifold on 
J^{X, X , fj,) is the set (indexed by u) of the one-dimensional exponential models 



p 



Usually we will identify the tangent exponential model e*" -f^pM . p with its score 

d 



^^{tu-Kp{tu)) 



u e Bp 



t=o 

The tangent space inherits the structure of Banach space from Bp. 

Definition 7 (Sub-manifold, sub-model) Let M he a subset of the exponential manifold 
Ai{X, X, /i) and, for each density p e J\f, let Vp and Vp be closed subsets of Bp, such that there 
exist: 

1. a linear invertible and bi- continuous mapping between Bp and some direct sum Vp -{-Vp. 
That is Vp and Vp split in Bp. 

2. a chart on a neighbourhood Wp of p: 

: >Vp ^ V^ + V^, 

where Up maps Wp onto the product of to open sets Vp x Vp and Af nWp onto Vp x {0}. 
We will say that M is a sub- model or a sub-manifold of the exponential manifold A^(-^, X, /i). 

A sub-manifold is a manifold defined by the restricted maps. For a hst of examples see [22]. 
Our basic example is a finite dimensional exponential family 

EM{c)^{p{;9),9ee}, 
p{;e) :=exp[^^c(-)-V'(^)], 
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where c— (ci, . . . , c„), and is a convex open set in M^. In this case the local representation 
at p{-,9o) is 



3 Evolution of marginal laws of a diffusion process 

On the complete probability space JF, P) let us consider a stochastic process {X^, t > 0} of 
diffusion type. Let the dynamic equation describing X be of the following form 



where {Wt,t > 0} is a standard Brownian motion independent of the initial condition Xq. The 
equation above is an Ito stochastic differential equation. In the following derivation, we treat 
the scalar case. The following set of assumptions will be in force throughout the paper. 

(A) Initial condition: We assume that the initial state Xq has a density po w.r.t. the Lebesgue 
measure on M, with po almost surely positive. 

(B) Local strong existence: / G C^'^, a G C^'°, which means that / is once continuously 
differentiable wrt x and continuous wrt t and a is twice continuously diffcrcntiable wrt x 
and continuous wrt t. This assumptions imply in particular local Lipschitz continuity. 

(C) Non-explosion : there exists K > such that 



for alH > 0, and for all x E IR. 

Under assumptions (A), (B) and (C) there exists a unique solution {Xt , i > 0} to the state 
equation, see [27], theorem 10.2.1 with (j){x,t) — x'^. 

(D) We assume that the law of Xf is absolutely continuous and its density Pt{x) has regularity 
C^'^ and satisfies the Fokker-Planck equation (FPE): 



9) = exp[{9 - 9of[c{-) - ^'(^o)] - (m - ^(^o)) + {9- ^o) V(^o)]p(-, ^o), 



and the relevant splitting is 




dXt^ft{Xt)dt + at{Xt)dWt 



2xft{x)+at{x) <K{l+\x\^), 



(7) 
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where the backward diffusion operator Ct is defined by 



dx ^ dx'^ ' 
and its dual (forward) operator is given by 

We assume also Pt{x) to be positive for alH > and almost all x E IR. 

Assumption (D) holds for example under conditions given by boundedness of the coefficients / 
and a plus uniform ellipticity of at, see [27] theorem 9.1.9. Different conditions arc also given in 
[16], theorem 6.4.7. Now we rewrite equation (7) in the exponential coordinates (6). Consider 
as local reference density the solution pt of FPE at time t. We are now working around pf. 
Consider a curve around pt corresponding to the solution of FPE around time t expressed in 
Bp^ coordinates: 

(-e,e) ^ Vp, 

h ^ SpXpt+h) -■ Uh- 

The function Uh represents the expression in coordinates of the density 

Pt+h = exp[uh - KpXuh)]pt =■ ehPt- (8) 
Now consider FPE around t, i.e. 

dpt+h _ 

Substitute (8) in this last equation in order to obtain 
Write 

dh Pt 

and set h — 0, since we are concerned with the behaviour in t. Notice that Cq = exp[iio — 
Kp^{uo)] = exp(O) — 1, and that 

deh. _ f d[uh - Kp^ {uh)] _ d[uh - Kp^ {uh)] , 
^[.=0 - {en ^ }[.=o - [.=0. 
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Moreover, by straightforward computations (write explicitly the map Kp^, use Uh — Sp^{pt+h) 
and differentiate wrt h under the expectation Ep^) one verifies 



dKp^{uh) 



dh 

so that 

duh 



= 0, 



h=0 Pt 



dh 

is the formal representation in exponential coordinates of the tangent vector in pt. Notice that, 
again by straightforward computations, 

af.^atipt)^^ = _/,^(logp,)-^+ (10) 
Pt ox ox 



^2 Q 

+ W'^tQ^i^ogpt) + at {-^{logpt jf + 



dat d , d'^at , 

Summarizing: consider the curve expressing FPE around pt in Bp^ coordinates. Its tangent 
vector is given by at- Under suitable assumptions on the coefficients ft and at the function at 
belongs to Bp^, according to the convention that locally identifies the tangent space of normed 
spaces with the normed space itself. To render the computation not only formal we need at to 
be really a tangent vector for our manifold structure. This in turn requires the curve t ^ pt 
to be differentiable. Below we give a regularity result expressing a condition under which this 
happens and whose proof is immediate. Moreover, we give a condition which can be used to 
check whether the evolution stays in a given submanifold. 

Proposition 8 (Regularity and finite dimensionality of the solution of FPE) 

(i) If the map t ^ pt is differentiable in the manifold Ai then at given in eq. (10) is a tangent 
vector. 

(ii) If the map t at is continuous at to into _Lcosh--i^ ^j^^^ ^ ^ differentiable at to as 
a map into M.. 

(Hi) Let be given a submanifold H such that po e J\f. If the previous condition is satisfied and 

qp 
p 

is tangent to M at p for all p e N , then pt evolves in M . 
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Sufficient conditions under wfiicfi condition (ii) in tfie proposition fiappens to be true are 
given by boundedness for all possible T > of /, d^f, a, d^a, d^^a in [0 T] x ^ plus classical 
assumptions given in Stroock and Varadhan [27] theorem 9.1.9, or Friedman [16], theorem 
6.4.7, ensuring existence of a regular solution or Fokker-Planck equation (as required in (D)). 
This follows from the fact that if at{x) is continuous and bounded in both t and x, then it is 
continuous as a map t ^ at from [0 T] to ^cosh -i 

In the following we give examples where this proposition applies. Some of them are obtained 
from [13] where the detailed derivation is given. 

Example 9. [Linear case] If ft{x) = FfX for all t > 0, x E IR {f linear in x) and if at{x) = At 
for all t > 0, X E IR {a does not depend on x) and if finally po ~ A/'(mo, Qq) then it is known 
that pt ~ J^{rnt, Qt) where mt = mocxp /q Fgds and Qt is the (unique) positive solution of the 
(scalar Lyapunov) equation 

Qt^'iFtQt + At, 

with initial condition Qq given. Consider now a generic Gaussian density p ~ J\f{m, Q) and 
compute 

,CfP., . .Ft At , 9 ,Ftm Atfn. Atw? ^ At , , 

When applied to pt, the previous formula yields at- 



Qt 2QV ^ Qt g? ' 2Q1 2Qt' 

where rut and Qt have been defined above. 

In this case the previous proposition applies. First, one sees that 1 1— > is indeed continuous 
at any Iq in L^osh — i Secondly, one can deduce already from (11) without solving the Fokker- 
Planck equation that the solution will have a Gaussian density. Indeed, one can easily check that 
the tangent space to the Gaussian submanifold of Ai expressed in B coordinates contains the 
function space span{l, x, x^}. Since by expression (11) we see that (>Cjp) /p lies in span{l, x, x^} 
for all p in the Gaussian submanifold, we deduce that the solution of the Fokker-Planck equation 
will evolve in the Gaussian submanifold. 

Example 10. [Nonlinear diffusions with unit variance Gaussian law] Let be given a diffusion 
coefficient at{x) satisfying assumptions (B) and assumption (C) when the drift vanishes, i.e. 
when f — (we set as usual a :— a^). In [13] it is shown that by defining the drift 



Mx) := i^(^) + lat{x)[kt - x] + k, 



the Fokker-Planck equation for the density of the solution of the stochastic differential equation 

dXt = MXt)dt + at{Xt)dWt Xo~Ar(0,l), 
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is solved by pt ~ N{kt, 1) for all possible diffusion coefficients at{x). Here the solution of the 
Fokker-Planck equation evolves in a submanifold of M. given by Gaussian densities with unit 
variance. Actually, the mean of pt evolves linearly in time and the variance is fixed to one. Note 
that in this case 

at = dt \ogpt = k{x - kt), 

and the curve t ^ at is clearly continuous at any to in ^cosh — i q^^^ might check a priori that 
if a given p belongs to the submanifold of Gaussian densities with unit variance, then (C^p) /p 
belongs to the tangent space of this submanifold if the mean is given by kt. Indeed, we are 
considering the family 

p{x, 0)^^ exp[-i(x - 9f] ^ Af{9, 1), 9 em, 

and its tangent space expressed in -Bp(.,6)) coordinates, span{a; — 9}. Let us compute 

at,0{x) = l{d^at{x)) {kt -9)+ ^at{x){x - 9){kt - 9) + k{x - 9). 

Under reasonable assumptions on a, this function belongs to the tangent space spanja; — 9} 
if and only if = kt. We have been able to check that the density of the diffusion X evolves 
according to pt ~ Af{kt, 1) without solving the Fokker-Planck equation. 

^From examples given in [13] one can construct other nonlinear cases where the above 
proposition applies. 

4 Projection of the Fokker-Planck equation 

In reaching equation (9) wc assumed implicitly a few facts. Wc arc assuming that there always 
exists a neighborhood of h = such that in this neighborhood pt^h G Up^. Conditions under 
which this happens will be examined in the future. Wc only remark that when projecting on 
a finite dimensional exponential manifold, these conditions are not necessary for the projected 
equation to exist and make sense, sec below. Neither we need equation (9) to have a solution to 
obtain existence of the solutions of the projected equation. Now we shall project this equation 
on a finite dimensional parametrized exponential manifold. First notice that according to propo- 
sition 8 the law pt remains finite dimensional if there exists a finite dimensional parametrized 
submanifold of M., 

S^{p(;9),9ee}, 

such that the corresponding tangent vectors at{p{-,9)) of the FPE are in the tangent space 
of this finite dimensional submanifold. We take the set open in M"^. If we look for a finite 
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dimensional exponential family, we can select a submanifold 

EMic) = {p{;9),9ee}, 



We will assume the following on the family EM{c) (sec [9], [10] for other more specific assump- 
tions) : 

(E) We assume c e C^. 

Notice that tangent vectors around a point 9t) of a generic curve h i— > 9t+h) on EM{c), 
are now obtained according to (after straightforward computations and the chain rule) 

Qj^ \h=o - 2^[Ci[-) - Eo^Ci\y^. {12) 

As a consequence, the tangent space at 9 is given by 

TgEM{c) = Span{ci(-) - EqCu C^(-) - EgCm}, 

where Eg{(f)} := / (f){x)p{x,9)dx. Consider the following inner product in TgEM{c) (and more 
in general in Bp(^.fi)): 

{vi, V2)e := Eg[vi Vi], Vi, V2 e -Bp(.^e). 

Consider the quantities 

g{9)ij := (q(-) - Egd, Cj{-) - EgCj)g, i, J = 1, . . . ,m. 

Notice that the matrix g{9), expressing the inner products of tangent vectors in TqEM{c), is 
nothing else than the traditional Fisher information matrix 

( Ee [deAogp{-,9) de.logp{-,9)] )i,j=i,...,m 

for the family EM{c) (see also [9], [10]). Now define for all 6* e the orthogonal projection 

: TgEM{c) 

m m 

n^H := EiEa'^iO) {v,cj{-) - Eoc,)e] (q(-) - Eec,). 
i=i j=i 

A rapid computation involving duality between C and C* and standard results on the normal- 
ization constant ip{9) of exponential families (such as dg^ip{9) = EgCi) yields 

m^^^T^] - EoiUf g-\9) [c{.) - Eec], 
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where integrals of vector functions are meant to be applied to their components. Note that 
this map is regular in 6 under reasonable assumptions on /, a and c. At this point we project 
equation (9) via this projection. By remembering expression (12) for tangent vectors and the 
above formula for the projection we obtain the following (m-dimensional) ordinary differential 
equation (in vector form) on the manifold EM[c): 

[c(.) - EeA^Ot = EeXCtcY g-\dt) [c{-) - EeA- (13) 

It follows immediately the following ordinary differential equation for the parameters: 

Ot = g-\et) EeACt c}. (14) 

Notice that, as anticipated above, equation (14) is well defined and admits locally a unique 
solution if the following condition (ensuring existence of the norm of at{p{-,6t)) associated to 
the inner product (• , •)6)J holds: 

(F) Ee{ale} < oo e 6, (15) 

+>t^i0-c)+a,{l{9-c)r + 
dx dx dx"^ 

We will assume such condition to hold in the following. Notice that this is a condition on the 
coefficients /, a, c. We have thus proven the following 

Proposition 11 ( Projected evolution of the density of an Ito diffusion) Assume as- 
sumptions (A), (B),(C), (E) and (F) on the coefficients f,a, on the initial condition Xq of the 
ltd diffusion X, and on the sufficient statistics ci, . . . ,Cn of the exponential family EM{c) are 
satisfied. Then the projection of Fokker-Planck equation describing the evolution of pt — pxt 
onto EM{c) reads, in Bp^ coordinates: 

[c(-) - Ee,cf9t = EeACcf g~\9t) [c(-) - E,,c], 

and the differential equation describing the evolution of the parameters for the projected density- 
evolution is 

Ot = g-\et) EeXLt c}. 

Notice that the projected equations exist under conditions which are more general than condi- 
tions for existence of the solution of the original Fokker-Planck equation. For more details see 
Brigo [13]. 
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5 Interpretation of the projected density as density of a 
different diffusion 



In this section we shortly expose a problem which was treated in [13]. Consider the projected 
density p{-,9t), expressing the projection of the density-evolution of the one dimensional dif- 
fusion X onto the exponential manifold EM{c) . The question is: Can we define a diffusion Yf 
whose density is the projected density p{-,Ot) ? If the answer is yes, Yt is a diffusion whose 
density evolves in a finite dimensional exponential manifold assigned a priori (for example 
Gaussian). In order to proceed, define a diffusion 

dYt ^ Ut{Yt)dt + at{Yt)dWt, Yo^Xo, (16) 

with the same diffusion coefficient as Xt. We shall try to define the drift u in such a way that 
the density-evolution of Yf coincides with p{-, 9t). Call % the backward differential operator of 
Yt: 



dx ^ dx'^ 

Consider the right hand sides of (9) and (13). Clearly, the density of Y^ coincides with p{-, 0^) if 

^-^f^ = E,J£cr g-\9t) H-)-EeA 
Pv, t't) 

which we can rewrite as 

r*p{;et)^Vt,e,p{;et). 

By simple calculations one can rewrite the above equation as the following PDE for u, where 
we do not expand the second partial derivative of atp{-, 9): 

dUt „T "^C 1 <9^ , , n\\ ^ 

Call Btfi^ the right hand side of such equation. A solution is given by 

/X 
Bt,et{y)exp[ejc{y)]dy, 
-oo 

as one can verify immediately by substitution. Straightforward calculations yield 

p{x,et) J -oc' p{y,et) p{y,0t) 

+n,{^i^^^}M,,^,)d,= 

= ^,^ix) + lat{x)ej^{x) + 



-EoAUVg-\Ot) r {c{y)-Ee,c) eM^I {c{y) - c{x))]dy . 

J —oo 
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^Prom this last equation one sees that under condition (15) and under the assumption that 
densities of EM{c) are integrable, the above integral always exists. 
We have thus proven the following 

Proposition 12 (Interpretation of the projected density-evolution) Assume assump- 
tions (A), (B), (C), (E) and (F) on the coefficients f,a and on the initial condition Xq of the 
ltd diffusion X and on the sufficient statistics c of the exponential family EM{c) are satisfied. 
Letp{-,9t) be the projected density evolution, according to proposition 11. Define 

dYt = u;{Yt)dt + at{Yt)dWt, 

i^(a;) + |a,(x)^r^(x) + 

-EeACtcY9'\et) r {c{y)-Ee,c) eM^U^y) - c{x))]dy. 

J —oo 

Then Y is an ltd diffusion whose density-evolution coincides with the projected density-evolution 
p{;9t) ofXt onto EM{c). 

6 Further research on convergence of the projected den- 
sity towards the original one 

Now wc consider the problem of the convergence of the projected density-evolution to the 
evolution of the limit diffusion. The idea is to consider a sequence of nested finite dimensional 
families and to check what happens when the dimension of the family on which the equation 
is projected tends to infinity. The problem we shall investigate in the future is the following. 
Suppose we can define a sequence of families in the following way: 

(G) we are given a sequence of functions (cj)-,ejv- Call c"* :— {ci C2 ...Cm}, and assume 
that for all m the family EM[c"^) is a finite dimensional exponential manifold satisfying 
assumptions (E) and (F). Call p{-,6^) the density coming from projection of Fokker- 
Planck equation onto EM[c^). 

As we saw in the preceding section (see (17)) this is also the density of a diffusion process 
with the same initial condition, the same diffusion coefficient and drift given by 
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Hence, if we prove that converges in law towards the original diffusion X, we have a first 
convergence result of the projected evolution towards the original one. Now, it is well known that 
since all the diffusions share the same diffusion coefficient, under reasonable assumptions 
on a the sequence of the laws of {Y'^)m is relatively compact in the space of processes with 
continuous trajectories (see for example Stroock and Varadhan [27] or Bafico and Pistone [3]). 
Moreover, if the drifts li"* weakly converge to /, the law of will converge to the law of 
X. By looking at expression (18), one sees intuitively that this should happen. Indeed, assume 
p{-,6^"') admits a limit p{-) when m tends to infinity, and suppose the projection tends to be 
exact when m tends to infinity. Then in formula (18) replace p{-,0"^) by p and eliminate the 
projection operators. The expression for Ut{x) simplifies to ft{x), so that we have pointwise 
convergence of the drifts u"^ towards / and wc arc done. Of course, one needs to make the 
above idea precise, to show reasonable choices of c™ and to prove the result rigorously. These 
problems will be investigated in the next future. 
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